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On the Base Station Selection and Base Station 
Sharing in Self-Configuring Networks 

S.M. Perlaza, E.V. Belmega, S. Lasaulce, and M. Debbah, 

Abstract 

We model the interaction of several radio devices aiming to obtain wireless connectivity by using a set of base 
stations (BS) as a non-cooperative game. Each radio device aims to maximize its own spectral efficiency (SE) in 
two different scenarios: First, we let each player to use a unique BS (BS selection) and second, we let them to 
simultaneously use several BSs (BS Sharing). In both cases, we show that the resulting game is an exact potential 
game. We found that the BS selection game posses multiple Nash equilibria (NE) while the BS sharing game posses 
a unique one. We provide fully decentralized algorithms which always converge to a NE in both games. We analyze 
the price of anarchy and the price of stability for the case of BS selection. Finally, we observed that depending on 
the number of transmitters, the BS selection technique might provide a better global performance (network spectral 
efficiency) than BS sharing, which suggests the existence of a Braess type paradox. 

I. Introduction 

In this paper, we consider the case where several radio devices aim to obtain wireless connectivity by using 
several base stations (BS). Here, each device must strategically determine the set of BSs to use, as well as the 
corresponding power level allocated to each BS to maximize its own spectral efficiency in bps/Hz. In this context, 
we consider two different scenarios. First, we let each player to use a unique BS (BS selection) and second, we let 
them to simultaneously use several BSs (BS Sharing). 

Note that if only one BS is considered, our model simplifies to a multiple access channel (MAC). Here, when 
all transmitters access the BS using the same carrier, each device uses its maximum transmit power. When, the BS 
is accessible through out several carriers, each transmitter uses a water-filling power allocation (PA) considering 
the observed multiple access interference as background noise at each carrier [1], In the first case, such solution is 
Pareto optimal, if and only if the sum of the achieved Shannon rates lies in the convex hulk of the capacity region 
of the corresponding MAC [2]. Generally, this condition may require certain coordination between the transmitters, 

Samir Medina Perlaza is with Orange Labs, France Telecom R&D. 38 — 40, rue du General Leclerc, 92794, Issy les Moulineaux, cedex 9. 
France, (samir.medinaperlaza@orange-ftgroup.com) 

Elena- Veronica Belmega and Samson Lasaulce are with LSS (CNRS-SUPELEC-Paris Sud). 3, rue Joliot-Curie, 91192, Gif-sur-Yvette, cedex. 
France, ({belmega, lasaulce} @lss. supelec.fr) 

M. Debbah is with Alcatel-Lucent Chair in Flexible Radio at SUPELEC. 3 rue Joliot-Curie, 91192, Gif-sur-Yvette, cedex. France, 
(merouane. debbah® supelec.fr) 



August 12, 2009 



DRAFT 



2 



which can be achieved by using pricing methods [3] . Conversely, in the second case, the solution is always Pareto 
optimal [1]. In a more general context, when there exists several BSs and regardless of the performance metric, the 
model remains being a subject of intensive research [4], [5], [6], [7], [8], [9]. 

Up to the knowledge of the authors, the state of art of the BS sharing and BS selection scenarios is described by 
the following contributions: [10], [4], [5], [8]. In [10], the BS selection problem is investigated by considering that 
each node is characterized by a fixed single user spectral efficiency. Here, the authors showed that based on the 
scheme of exponential learning, players converge to an evolutionarily stable equilibrium. Additionally, the authors 
showed that the price of anarchy of such a game is unaffected by disparities in the nodes' characteristics. In [8], 
the authors studied the BS selection scenario assuming that the transmitters aim to minimize their transmit power 
level required to achieve a target signal to interference plus noise ratio (SINR). Here, the interaction between the 
radio devices is modeled as an atomic and non-atomic potential game [11] to study the existence, uniqueness and 
efficiency of the NE. Other contributions using potential games for radio resource allocation are [12], [13], [14], 
[15], [16]. In [5], the non-atomic extension of the BS selection game and the atomic extension of the BS sharing 
game are investigated. Therein, the performance metric is the Shannon rate and channel realizations are considered 
identical for all transmitters. Regardless of the possibly unrealistic assumption, the authors of [5] identified the 
existence of at least one NE in the non-atomic BS selection game and the existence of a unique equilibrium in 
the atomic BS sharing game. In [4], the authors study the BS selection and sharing scenarios when the number 
of receivers is equal to the number of transmitters and the performance metric is their overall SINR, i.e., the sum 
of the SINRs obtained at each BS. In this context, it is showed that when all players observed the same channel 
realization (as in [5]), restricting each player to choose only one BS produces a socially optimal NE. Conversely, 
when the players are left free to share their powers among several BSs, their utilities are strongly decreased. This 
effect is known as the Braess Paradox in the frame of congestion games [17]. 

In this paper, we tackle the BS selection and BS Sharing scenarios by modeling them as potential games. Contrary 
to previous works [8], we consider as performance metric the spectral efficiency of each player and we let the channel 
realization for each transmitter to be independently drawn from a given probability distribution. In the former case, 
we study both the atomic and non-atomic extensions of the game. In the atomic case, we show the existence of 
multiple NE and we use the best response dynamics to provide fully distributed algorithms to achieve a NE. This 
algorithm is proved to converge independently of the channel realization and the bandwidth allocated to each BS. 
We measure the price of anarchy of this solution and we observed that the performance of the fully decentralized 
solution (self-configured network) is close to that one obtained when there exists a central controller (optimally 
configured networks). In the non-atomic extension, we provide the optimal fractions of transmitter which must join 
each BS depending on their available bandwidths. Regarding, the BS sharing game, we show the existence of a 
unique NE. As in the previous case, we provide a fully decentralized algorithm which allows achieving a NE with 
probability one. Finally, we compare both scenarios and identify that BS selection performs better than BS sharing 
when there exists almost the same number or more transmitters than BSs. As identified in [4], this observation 
constitutes a Braess type paradox, which implies that increasing the space of strategies of each player, i.e., the 
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number of BSs each player can use, ends up degenerating the global performance of the network. 

II. System Model 

Notation: In the sequel, matrices, vectors and scalars are respectively denoted by boldface upper case symbols, 
boldface lower case symbols, and italic lower case symbols. The transpose and Hermitian transpose of a vector x 
(matrix X) is denoted by x T and x H (resp. X T and X H ). The sets of natural and real numbers are denoted by 
IN and R, respectively. Finite sets of natural numbers are denoted by calligraphic upper case letters. Given two sets 
denoted by A and B, their Cartesian product is denoted by Ax B. The cardinality of set A is denoted by \A\. The 
5-dimensional vectors e s , for all s £ {1, . . . , S} and S £ IN, denotes a vector with zeros in all its entries except 
the s-th entry which is unitary. The operator [x} + , with x £ 1R, represents the operation max(0,a;). 

Consider a set K. = {1, . . . , K} of transmitters and a set of S = {1, . . . , S} receivers, e.g., base stations (BS) 
or access points (AP). Each transmitter can access the network by using a (non-empty) set of BSs. Each BS 
operates in a specific frequency band and we neglect any type of interference due to the adjacent bands (adjacent 
channel interference). We denote the bandwidth associated with BS s £ S by B s and the total network bandwidth 
by B = X]f=i Bs- Each transmitter sends private messages only to its corresponding BSs and it does not exist 
any kind of information exchange between transmitters neither before nor during the whole transmission. Both 
transmitters and BSs are equipped with single antennas. Transmitter fc £ IC is able to simultaneously transmit to 
all the BSs subject to a power constraint, 

s 

Vfc £ IC, ^Pfe, s <Pfe .max • (i) 
s=l 

where pk,s and pk, max respectively denote the transmit power dedicated to BS s by transmitter fc and its maximum 
total transmit power. Without any loss of generality, we assume that all transmitters are limited by the same maximum 
transmit power level, i.e., Vfc £ IC, pt.max = Pmax- 

For all (fc, s) £ IC x S, we denote the channel coefficients between transmitter fc and BS s by hk, s - Each channel 
coefficient hk. s is a realization of a circularly symmetric complex Gaussian random variable h with zero mean 
and unit variance. We consider a slow fading channel, so that all channel realizations remain constant during the 
transmission time. The baseband received signals sampled at symbol rate at BS s, denoted by y s , can be written 
as a vector y = (yi, . . . ,ys) > such that 

Vs £ S, y s = h s x^ + w. (2) 

Here, for all (fc, s) £ IC x S, the if -dimensional vector h s = (/ii )S , . . . , h,K, s )- The if-dimensional vector x s = 
(xi tS , . . . ,xk, s ), and Xk, s represents the symbol sent by transmitter fc to BS s. The power allocation vector of 



represents the power transmitted toward BS 



transmitter fc is the vector (pk.ii ■ ■ ■ tPk,s), an d Pk.s = E 
s by player fc. The 5-dimensional vector w = (wi, . . . ,ws), with w s ~ AT (0, of) represents the noise at the 
receivers. Here, a 2 s = No B s , where iVo denotes the noise spectral density. 
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The SINR of transmitter fc at BS s is denoted by jk.s and V(fc, s) £ AC x S, 

Pk,sfjk.s 

Ik.s = — : , (3) 

where, (k,s — + J]] Pj.s9j,s represents the noise plus multiple access interference (MAI) undergone by player 

jeK.\k 

k at BS s and <j% jS = |/ife. s | 2 represents the channel gains. We denote by IC S the set of transmitters using the BS 
s. Then, we define two different scenarios depending on the conditions over the sets JC S for all s £ S. In the 
first scenario, named BS selection, each transmitter uses a unique BS. Thus, for all s £ S, the sets AC S such that 
\K a \ > form a partition of the set AC, i.e., V(j, k) £ S 2 and j ^ fc, AC, n AC fe = and ACi U . . . U AC S = AC. In the 
second scenario, named BS Selection, a given transmitter is allowed to simultaneously use several BSs. Thus, for 
all s £ S, the sets JC S form a cover of the set AC, i.e., Vs £ S, JC S C AC. In the following two sections, we study 
both scenarios. Later, we compare their performance by simulation results. 

III. Base Station Selection Games 

Assume that each transmitter can be modeled as a rational selfish player and that such an assumption is common 
knowledge among all players. Then, the BS selection scenario can be modeled by a non-cooperative game Q\ 
described by the tuple (AC, {Vk) k£K , (uk)keK.)- Here, the set of transmitters K. is the set of players. The strategy 
of a given player fc G AC is its PA scheme, i.e., the S'-dimensional PA vector p k = (pk,i, ■ ■ ■ ,Pk,s) £ ^fe. where Vk 
is the set of all actions of player k (strategy set). Since each player only transmits to a unique BS, its strategy set 
is defined as a finite set Vk, 

Vk = {pk e s : p k £ [0,p max ] , Vs £ S, e s = (e fl) i, . . . , e a ,s) 

and Vr £ S \ s, e s , r = 0, and e s . s = 1} . (4) 

Then, a strategy profile of the game is a super vector 

p = (px, ■ ■ -,p K ) e v, 

where V is a finite set obtained from the Cartesian product of the strategy sets Vk, for all k £ AC, i.e., V = 
V\ x ... x Vk- Let us denote by p k any vector in the finite set V-k = V\ x . . . x T^-i x "Pfc+i x . . . x "P^- For 
a given fc £ AC, the vector denoted by p_ fc represents the strategies adopted by all the players other than player k. 
The utility function for player fc, is defined as uu : "P — > H+ and measures the satisfaction of player fc with respect 
to its chosen strategy [18]. In this study, we define the utility function for all players as their spectral efficiency, 
i.e., the ratio between their Shannon transmission rate and the available bandwidth B: 

MPkiP-k,) = J2 "ET log 2 ( X + 7fc,s) > ( 5 ) 

where jh iS is given by (Eq. 3) and p E V. 

In the sequel, we consider a finite number of transmitters (players) such that each player is concerned with the 
strategy (BS selection and transmit power allocation) adopted by all the other players due to mutual interference. 
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We name this model: atomic BS selection game. In the second part, we consider a high number of players such 
that each of them is indifferent to the strategy adopted by every single player. In this case, each player is rather 
concerned with the fraction of players adopting the same strategy. We name this model non-atomic BS selection 
games. 

A. Atomic BS Selection Games 

In the atomic extension of the BS selection game Gi, our interest is to find a strategy profile p* G V such that 
no player is interested in changing its own strategy. Once the network configuration p* is reached, any unilateral 
deviation of a given player decreases its own utility. A network configuration p* is known as a Nash equilibrium 
[19]. 

Definition 1 (Nash Equilibrium): In the strategic game Q\, a strategy profile p G "P is an NE if it satisfies, for 
all k G K, and for all p' k G Vk, that 

Uk(Pk,P-k) > u>k(p'k,P-k)- (6) 

In the following, we analyze the existence, multiplicity and determination of such strategy profiles. 

1 ) Existence of at least one NE: Our first step toward identifying the strategy profiles leading to a NE is to 
prove that there exists at least one NE for any specific number of transmitters and BSs regardless of the channel 
realizations. There exist several methodologies for proving this [20]. In our case, we first show that the game Q\ 
is a potential game (PG). 

Definition 2 (Exact Potential Game): Any strategic game Q defined by the tuple (/C, (Vk)keic > ( u k)keic) * s an 
exact potential game (PG) if there exists a function <f> (p) for all p G V such that for all players k G K, and for all 
p' k G Vk, it holds that 

Uk(p k ,P-k) - u k (Pk,P-k) = 4>(P) ~ (7) 
where p' = (p ls . . . J p fc _ 1 ,pj b ,p fc+1 , • • • ,P K )- 

Def. 2 together with Eq. (5) allow us to write the following proposition: 

Proposition 3: The strategic game Gi is an exact potential game with potential function 

s£5 V fc=l / 

Since the BS selection game Q\ is a PG (Prop. 3), the following proposition (Prop. 4) is an immediate consequence 
of Corollary 2.2 in [11]. 

Proposition 4 (Existence of the NE): The BS selection game Q\ always has at least one NE in pure strategies. 

2) Multiplicity of the NE: Once we have ensured the existence of at least one NE, we determine whether there 
exists a unique NE or several NE. As a first step, we show that rational players always transmit at the maximum 
power level p max : 

Proposition 5: In the BS selection game Qi, all players will always transmit at the maximum power independently 
of the channel chosen to transmit. 
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Proof: The utility function of player k E K. transmitting to a given BS s G S is u k (pk e s ,P- k ) = 1°8'2 ^1 + g^'" 
Then, since the logarithmic function is an increasing function, we have that V(fc, s) G K x 5, and Vp^ G [0,p max ]> 
u k (p k e S7 p_ k ) = log 2 (l + ^If^) < u k (p max e s ,p_ k ) = log 2 (l + Pm ^ g 3 fc,a )■ Hence, rational players will always 
use their maximum transmit power level. ■ 
Prop. 5 shows that the strategy set in (4) can be re-defined as follows 

Vk = Kax e s : Vs G S, e s = (e S) i, . . . , e SjS ) 

and Vr E S \ s, e SiT = 0, and e SjS = 1} . (9) 

The re-definition of the strategy sets Vk in Eq. (4) allows us to study the multiplicity of the NE by using basic 
elements of graph theory. First, let us index the elements of the strategy set V by using the setX= {l,...,S K } 
such that they are ordered following the index i G X. Denote by pW the z-th element of the strategy set V. Let 
us write each vector pW with i G X, as a vector pW = \ ■ ■ ■ >Pjc^' where for all j G /C, G Vj. Second, 
consider that each of the strategy profiles pW with i G I is represented by a vertex Uj in a given non-directed graph 
G. Each vertex is adjacent to the K(S — 1) vertices representing the strategy profiles obtained by letting only one 
player to change its own strategy. Let us denote by Vj the set of indices of the adjacent vertices of vertex V{. More 
precisely, the graph G can be defined by the tuple G = (V, A), where the set V = {vi, . . . ,fs R } contains the 
S K possible strategy profiles of the game and A is a symmetric matrix (adjacency matrix of G) with dimensions 
S K x S K and entries defined as follows V(i,j) G I 2 and i ^ j, 

1 if i G V , 

(10) 

otherwise , 

and dij = for all i G I. In the non-directed graph G, we define the distance between vertices Vi and Vj, for all 
(i, j) G X 2 as the length of the shortest path between Vi and Vj. Considering the structure of G, a more precise 
definition can be formulated for the shortest path, 

Definition 6: [Shortest Path] The distance (shortest path) between vertices Vi and Vj, with G X 2 in a given 
non-directed graph G = (V, A), denoted by dij(G) G IN is: 

K 

d i , j (G) = d j , i (G) = J2lr w U n. (11) 

Note that the non-directed graph G satisfies the property: G X 2 , with i ^ j, 1 ^ dij(G) ^ X. Thus, 

for a specific number S of BSs and K transmitters, the maximum number of NE which can be observed is obtained 
as follows: 

Proposition 7: In a given BS selection game Q\ where the condition 

V(i,j) G X, with i^j, (p«) ^ (p«) (12) 

always holds, the maximum number of NE which can be observed is S K ~ 1 . 

Proof: Assume that a given strategy profile pW (vertex i>j) with i G X is a NE (Prop. 4). Then, given the 
condition (12) it follows that none of the vertices in the set Vi is a NE. Hence, two NE vertices must be separated 
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by a minimum distance two in the non-directed graph G = (V, A). Thus, we obtain the maximum number of NE 
by calculating the maximum number of vertices mutually separated by minimum distance two in G. Given any 
two vertices and Vj, for all G I 2 with i ^ j we have that dij(G) ^ 1. Then, the vertex vi and all the 

vertices Vj such that j G = |n el \ {i} : pjj™^ ^ Pfc'' j, for any k G /C, are separated by minimum distance 
dij(G) ^ 2. Then, for any (i, fc) elx /C, the set Ji.^ has cardinality \Ji : k\ = S K ^ 1 — 1. Then, the total number of 
points mutually separated by minimum distance 2 (including the reference vertex vi) is \Ji.k \ + 1 = S K ~ 1 , which 
completes the proof. ■ 
3) Determination of the NE: To evaluate the number of NE of the game Q\ for a specific set of channel gains, 
we use an oriented graph G = (v, A^j , where the adjacency matrix A is a non-symmetric square matrix whose 
entries are V(i, j) <E T 2 and i ^ j, 

1 if ie V 3 and cj> (p^) > (p«) 

(13) 

otherwise , 
and ai,i = for all i £l. 

In the graph G, we say that a given vertex Vi is adjacent to vertex Vj, if and only if (t>{Pi) > <f>(Pj) and 
dij(G) = 1. Note that the condition for adjacency in G represents the rationality assumption of players: A player 
changes its strategy if the new strategy brings a higher utility function, i.e., increases the potential function. In Fig. 
1, we show an example of the non-directed G and oriented G graphs for the case where K = 3 and S = 2. 

From the definition of the matrix A, we have that a necessary and sufficient condition for a vertex v,i to represent 
a NE strategy profile is to have a null out-degree: deg + (t>i) = (sink vertex), in the oriented graph G. Hence, 
obtaining the number of NE in the game Q\ boils down to identifying all the sinks in the oriented graph G. For 
doing so, it suffices to identify the indices of the rows of matrix A containing only zeros. If the i-th row of matrix 
A is a null vector, then the strategy profile pW i s a NE. However, building the matrix A requires complete CSI, 
since it is necessary to determine whether <$>{p^) > (f>(p^), 4>(p^) = 4>(p^) or 0(pw) < 4>{p^) for all i € X 
and j E V. L . 

To determine a strategy profile leading to a NE, in a distributed fashion with a less restrictive CSI at each radio 
device, we introduce the following definition: 

Definition 8 (Random Walks): A walk through an oriented graph G is an ordered list of vertices v^, . . . ,Vi N 
such that vertex Vi n+1 is adjacent to vertex Vi n , with i„ el for all n G {1, . . . , N}, and N ^ S K . We say that a 
walk is random if given a vertex Vi n , the vertex Vi n+1 is chosen randomly from the set Vi n . 
From Def. 8, we have the following result: 

Proposition 9: Any random walk in the oriented graph G ends in a vertex representing a NE. 

Proof: Each step of the walk, i.e. the transition from vertex Vi n to Uj n+1 , can be interpreted as changing from 
one strategy profile p( %a \ i n e I to another strategy profile p^ in + 1 \ i n+1 £ Vi n such that 0(p(' n ') < 
Since there exists a finite number of vertices in the graph, it turns out that any sequence 4>{p^) < 4>(p^) < 
. . . < 4>{p( lN ^), with i n G T for all n G {1, . . . , A^}, is finite, i.e., N ^ S K . Moreover, the walk is ended if and 
only if the vertex vn does not have any adjacent node, i.e., vertex Vn is a sink vertex. From the definition of the 
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adjacency matrix A in Eq. (13) it follows that any sink vertex represents a NE (Def. 1). Thus, any random walk in 
the oriented graph G ends in a NE. This completes the proof. ■ 
In practical terms, to perform a walk through the oriented graph G implies imposing certain rules on each transmitter 
of a given self-configuring network: (a) A given player changes its strategy if and only if the potential function can 
be strictly increased, (b) Two or more players do not change their strategy simultaneously, (c) All players have the 
same chances to update their strategies. The first condition derives from the fact that each player aims to maximize 
its own utility function. The second condition is to ensure that each step in the random walk is equivalent to going 
from a given vertex to one of its adjacent vertices. The third condition is to ensure a random walk, i.e., to ensure 
that each step is done with the same probability to any of the adjacent vertices. The last two conditions might 
require certain synchronization system among the transmitters. 

Algorithm 1 Base Station Selection Algorithm 
Require: Mk e IC, 

MAI Vector: C fc (0) = (C M (0), . . . ,C fc ,s(0)) 

Channel Realizations: g k = (gk,i, ■ ■ ■ ,9k,s), Vfc 6 K 

t <- 0. 

repeat 

t <- t + 1 

for k = 1 to K do 

s <- argmax log 2 (p max g k , s + - 1)) 

ies 

Phi*) <~ Pma X e s 

C fc (<) <- Cfc(*" 1) + (Pfc(*) - Pk(t - l))9fe 
until p(t) = p(t - 1) 



Note that if the algorithm is implemented in a distributed way, each player k e IC requires the knowledge of 

two parameters. First, the MAI level at each BS, i.e., the vector £ = (&,..., Cs)> where ( s = a 2 s + Pk,s9k,s 

keic 

and which is common to all the players. Second, the channel realization with respect to each BS, i.e., the vector 
9k = (ff&.ij • ■ ■ '9k,s)- Each element of the vector £ is obtained by feedback from the corresponding BS at a 
frequency higher than the reciprocal of the channel coherence time. Each element of the vector g k must be estimated 
by transmitter k using channel estimation techniques, e.g., combining channel reciprocity and training sequences 
in a two-way link. An important remark regarding the proposed algorithm is that the NE where a given walk ends, 
mainly depends on the starting vertex in the graph G and the order we let each player to update its strategy. Thus, 
if each player is randomly chosen for updating at a given point of time, it is not possible to predict the NE where 
a walk ends. Hence, this might lead us to the situation where the convergence point is a non-optimal NE regarding 
a global metric, e.g., the network spectral efficiency. We analyze the optimality issues in Sec. III-C. In Fig. 2 we 
show a walk through the directed graph G of a given BS selection game with K = 5 and S — 2 and a given set of 
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channel realizations. The potential obtained at each possible strategy profile pW, i.e., 4> (p(*') with i G X is plotted 
in Fig. 3 as a function of their index i. In Fig. 2, it can be seen how different walks end in different NE. 



B. Non-Atomic Base Station Selection Games 

In the non-atomic BS selection game, we consider that there exists a large number of players, such that players 
are indifferent to the strategy adopted by any single player. Each player is rather concerned with simultaneous 
deviations of fractions of the total number of players. Let us denote by x s the fraction of players transmitting to 
BS s, and assume that 

Vs G S k , x s = %i 

(14) 



1. 



We denote the ratio between the available total bandwidth B and the total number of transmitters K by a = 
Thus, the ratio between the available bandwidth at BS s and K, denoted by a s = satisfies 5Z s =i a s = Q - 
Using, these notations, the potential function <j> can be written as follows 

K 



<I>(P) = l0g 2 ( + £ Pk,s9k,a ) 

ses \ fe=i / 

= £ £f l0g 2 ( + ^ max £ 9k,s j 

ses V keic a J 



s£5 



B 



K N a s + p n 



\IC S \ K 
K 1/C.J 



V 



£ 



sG5 



B 



(15) 



Note that the term — - \og 2 (K) does not depend on the strategy of the players. Thus, following Lemma 2.7 



in [11], the function 



ses 



B 



(P) = £ l0g 2 ( ^0 « S + (z S Pmax) T^T X! 9k,s j 



(16) 



can be considered as another exact potential function of the BS selection game Q\. Now, we assume that the number 
of players grows toward infinity at the same rate that the bandwidth available at each BS, i.e., 



B 



oo and K 
B 



• lim — = a < oo, and 

B.K^oo K 

B s 

• Vs G S, lim — = a s < oo. 

B s ,K->oo K 

From a practical point of view, when the number of transmitters grows toward infinity while the total bandwidth 
or number of BSs remain constant, the MAI becomes a dominant parameter and thus, independently of the strategy 
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adopted by each player, their own utility function tends to zero. Thus, no quality of service can be guaranteed, 
for instance, in terms of minimum transmission rates. For avoiding such a situation, we have considered that the 
number of players grows to infinity at the same rate as the total bandwidth. This ensures that the utility function 
of each player does not tend to zero when the load (number of transmitters per BS) of the network is increased. 
Under these conditions, it holds that for all s G S, l/CJ — > oo, and thus: 



(17) 

where F g is the cumulative probability function associated with the probability density function f g of the random 
variable g (channel gains) described in Sec. II: dF g (X) = f(X)d\. 

This result allows us to write the function as a function of the fractions x%, . . . , xg, 



4>(x!, . . . , x s ) = V" — log 2 (iV a s + x s p max SY) , (18) 
* — ' a 

ses 

and thus, finding a set of fractions such that no player is interested to modify, i.e. a NE in the non-atomic extension 
of the game Q\ boils down to solve the following optimization problem (OP) [21], 

a* 



max V" — log 2 (N a s + x s p max fl) 



xi,...,xse.R+ — ' ot 
se5 



s (19) 



s.t. £ Xi = 1 and Vi G S, Xi ^ 0, 

i=l 



which has a unique solution of the form 



V.s G S, x s = -£. (20) 
B 



In Fig. 4, we show the fractions x s , with s G S, obtained by Monte-Carlo simulations and using Eq. (20) for a 
network with 5 = 6 BSs and K = 100 transmitters. Therein, it becomes evident that Eq. (20) is a precise estimation 
of the outcome of the non-atomic BS selection game. Note that if all the BSs are allocated with the same bandwidth 
B s = Vs G S, the fraction of players at each BS is identical, i.e., Vs G S, x s = h. This result is a generalization 
of the one in [5], where similar fractions were obtained for the case where each BS is allocated with the same 
bandwidth and players observe the same channel gains, i.e., V(fc, s) G /C x S, gk, s = 1- 

C. Efficiency of the Nash Equilibria 

Here, we evaluate the performance of the network when a completely decentralized stable configuration is achieved 
(Nash equilibrium) and the performance when there exists a central controller that dictates a configuration which 
maximizes a given global metric. In this study, we consider as global metric, the sum of the utilities of each player, 
i.e., the network spectral efficiency. To carry out such a study we consider two metrics known in the game theory 
jargon as price of anarchy and price of stability [22]. We formally define such parameters as follows: 
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Definition 10: In the strategic game (/C, {P k ) ke:c , ( u k) ke ic), denote the set of NE strategy profiles by V* C V. 
Then, the price of anarchy (PoA) and the price of stability (PoS) are the ratios, 

K 

max^ Ufc (p) 

PoA= ^ , (21) 

min5> fe (p) 

k=l 

and 

K 

max^ Ufe (p) 

PoS= , (22) 

max5> fc (p) 

fe=i 

respectively. 

The discrete nature of the strategy set of the players makes obtaining a closed-form expression for both the PoA and 
PoS, a very difficult task. Thus, we evaluate both PoA and PoS using Monte-Carlo simulations. For instance, in Fig 
5, we respectively plot the PoA and PoS for a network with S G {2, 3} BSs, and K G {1, . . . , 9} players. Therein, 
we observe that the loss due to decentralization is minimum since PoA — > 1, when the number of players increase. 
Similarly to the PoA, the PoS is also close to unity but different from the PoA. This implies that effectively, the 
case where several NE are observed, takes place often. As a concluding remark regarding the efficiency of NE, 
we state that the self-configuring nature of the network does not imply a significant loss of optimality, i.e., if the 
network were centralized by enforcing signaling protocols between all transmitters and the different BSs, the gain 
in network spectral efficiency will not justify the increment of signaling traffic due to the feedback of the optimal 
strategies. 

D. Equilibrium in Mixed Strategies 

For any player k G /C, let the vector q k = (qk.i, ■ ■ ■ > Qk,s) represent a discrete probability distribution over the 
set of pure strategies Vk- Here, q k:S represents the probability of player k transmitting to BS s. The mixed-strategy 
space of player k is the standard simplex Qk'. 

s 

Qk = {(9Mv,ft, s )eK S :^ft, s = 1 . 

s=l 

and Vs G 5, q k , s > 0}, (23) 

and the space of mixed-strategies is Q = Qi x . . . x Qk- Let s = (s%, . . . , sk) be a vector in the discrete set S K . 
Let us also index each element of the set S K with the set {l, . . . , S K } such that elements are ordered following 
the index i G {1, . . . , S K }. Denote by ~ (si \ . . . , s K ^j, the i-th element of such a set S K . Denote by pt? k , 
the vector 

p y -i = (piv--»Pfc_i,Pi!i,---,pjcJ , 
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where for all j G JC, pf' = p max e <o G Vj. In the mixed-strategy extension of the BS selection game Gi, the 
utility function Uk(q), with q = (qr 1 , . . . , q K ) e Q, is defined as the expected value of the corresponding pure 
strategy utilities with respect to the probability distributions q k for all k e /C, i.e. 

S K K 

U k (q k , q_ k ) = E II %, a fUk{pf (24) 
i=\ k=i 

Following Lemma 2.10 in [11], and since, the game Q\ is an exact PG (Prop. 3), we claim the existence of a 
potential function in the mixed-strategy extension of Q\. We denote such a potential by (f>(q), 

S K K 

*)=eii^»* , ' ) )' < 25 > 

i=i fc=i 

where q e Q. From [19], we know that there always exists a NE in mixed strategies for the game Q\. Thus the 
following OP must have at least one solution, 

S K K 

max 

s 



(?fe,a, V (fc,,)eKXS i=l fc=l 



S.t. Vfc e /C, 2 = 1 ) 

s=l 

V(fc, s )eCx5, g s , fc ^ 0. 
However, the solution to the OP (26) might not be necessarily a fully mixed strategy, i.e., a vector q k with more 
than one entry different from zero. Indeed, depending on the channel realizations, it is possible that no NE in fully 
mixed strategies is observed, i.e., 

3(k, s)e)CxS: V<? £ Q, U k {e s ,q_ k ) > U k (q k ,q_ k ). (27) 

For instance, consider the case when there exist two players and two BSs, i.e., K = 2, and 5 = 2. Then, we obtain 
that if there exists a pair (fc, s) e K, x S, such that 

2 

s$ — , (28) 

9k,s V s +Pmax5-fc,s 

the condition (27) always holds and thus, it does not exist a NE in fully mixed strategies. Here, we denote by — s 
and — k the element other than s and k, in the binary sets 5 and IC, respectively. Note that the non-existence of a 
NE in fully mixed strategies does not mean that it does not exist a NE in mixed strategies [19]. The existence of 
at least one NE in pure strategies has been proved (Prop. 4) and a pure strategy NE is also a (degenerated) mixed 
strategy NE. 

There exists several algorithms to iteratively solve the OP (26). Those algorithms are known in the domain 
of machine learning theory as linear reward inaction and linear reward penalty [23]. Several applications of those 
algorithms are presented in [13], [24], [25]. Contrary to the algorithms presented in this paper, linear reward inaction 
and penalty algorithms requires to set up some parameters to refine the convergence speed and the accuracy of the 
obtained probability distributions [23]. These parameters depend on the channel realizations, which means that at 
each coherence time such parameters must be re-adjusted by training. 
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IV. Base Station Sharing Games 

In this section, we consider the case where each player can be associated with several BSs. Here, each player not 
only selects its set of BSs but also the specific power level to transmit to each of its BSs. We define this interaction 
as a strategic game denoted by Q2 = (X, C^iOfcex: > ( u k)keic), where the set JC remains being the indices of each 
player as in the previous section, V represents the space of strategies, where V = V\ x . . . Vk and for all k G JC 

■Pk = {(Pic,i,...,ft,s)eE s :Vse5, p k , s > 0, 

and 

Pk.s ^ J^max ( ■ 

ses J 

The utility function remains being the spectral efficiency of each player as defined by Eq. (5). 

A. Existence and Uniqueness of the NE 

To study the NE of the BS Sharing game Q2, we first introduce the following proposition: 
Proposition 11: The BS sharing game Q2 is an exact potential game with potential function 4>(p) given by Eq. 
(8) for all peP. 

Prop. 1 1 leads us to the following result: 

Proposition 12: In the strategic game Q2 the strategy profile p* = (p*, . . . ,p* K ), with p£ = (p* k ■ ■ ■ ,P% s 
where for all (k, s) E IC x S, 



Pk. 



B s Cfe,. 1 



B Pk 9k,. 



(29) 



is the unique NE of the game. The constant (ik for each player k is set to satisfy the condition ^^Pk,s = Pmax 
and Cfc.s represents the noise plus MAI overcome by player k at BS s. 

Proof: To prove the existence of at least one NE, we use the fact that the BS sharing game Q2 is a PG (Prop. 
1 1). Then, following Corollary 2.2 in [11], the existence of at least one NE is ensured. Thus, proving the uniqueness 
of the NE ends up being equivalent to prove that the OP: 



m e a # Yl it log2 r 7 * + ^ /; < -' / -' - (30) 

P ses \ jes s J 

posses a unique solution. Indeed, since the potential function (f> is strictly concave on V and V is a simplex, and thus 
convex, the Karush-Khun-Tucker (KKT) conditions are necessary and sufficient conditions of optimality. Hence, 
we write: 

' V(k,s)e)CxS, f ( — ^- )- A + ^ = 

VfcG/C, Pk(j2Pk, s -PmJ\ =0 (31) 



\s=l 

V(A;, s) £ /C x S, VkPk,s = 0, 
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The solution to the system of equations (31) is known to be unique and achieved by using the water-filling algorithm 

[2]. Such a solution is given by expression Eq. (29) where (3 k is uniquely determined to satisfy the condition 

s 

/J Pk.s = Pmax, for all k G JC. This ends up the proof. ■ 

s=l 

B. Determination of the NE 

The NE of the BS sharing game C?2 is fully determined by Eq. (29). Here, we study a decentralized algorithm 
such that the NE in Prop. 12 can be achieved by players in a decentralized fashion. First, consider that the strategy 
space V is obtained from the Cartesian product of the closed convex sets Vk, for all k G JC. Second, note that the 
solution to the OP 

max (j> . . . , p k _ 1: x, p k+1 , . . . , p K ) (32) 

for all k G JC and for all p_ k G V-k is unique and can be determined by using the water-filling algorithm [2]. Thus, 
the OP (30) can be solved iteratively by using a non-linear Gauss-Seidel method. Denote by p k (t) the solution to 
p k at iteration t, where t G IN and p k (t + 1) is given by 

aignmx<p(p 1 {t + l),...,p k _ 1 {t + l),p k (t),p k+1 {t),...,p K (t)) . 

Then, following Prop. 2.7.1 in [26], the convergence of the sequence {p k (t)}te{i,...,N} f° r N >> is ensured. 
Based on this result, we introduce the algorithm Alg. 2 for the BS sharing game: 

Algorithm 2 Base Station Sharing Algorithm 
Require: VA; G fC, 

MAI Vector: C fc (0) = (C M (0), . . .,C fc)S (0)) 

Channel Realizations: g k = (gk,i, ■ ■ ■ ,3fe,s)> Vfc G JC 

t *- 0. 

repeat 

t<-t + l 

for k = 1 to K do 

p k (t) <- argmax V log 2 ipi,.~!li. ~ + &, s (t - 1)) 
p k ev k f^ s B 

C k (t) ^t k {t-l) + {p k {t)-p k {t-l))gl 
until p(t) = p(t - 1) 



In Fig. 6 we show the convergence to the maximum of the potential function <fi using Alg. 2 for the case of a 
network with K = 6 transmitters and 5 = 3 BSs. Therein, we show both a round Robin and random updates. In 
both cases, the convergence is achieved in very few iterations. 
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V. Performance Analysis 

In this section, we use algorithms Alg. 1 and Alg. 2 to compare the global performance of the network when 
BS selection and BS sharing are used. We choose the network spectral efficiency as the global performance metric, 
i.e., the sum of the utilities of all players. In fig. 7, we plot the network spectral efficiency for a network with 
S G {2, 4, 8} BSs and K £ {2, ... , 60} transmitters assuming an SNR of 10 dB for each player. We observe that 
when K < S the BS sharing technique performs better than BS selection. However, when K ^ S, the performance 
of the BS selection is strongly superior to BS sharing. For a large number of transmitters both techniques perform 
similarly. 

Note that the strategy space of each player is bigger in the BS sharing scenario. Thus, one can think that a better 
performance is always obtained by using BS sharing than using BS selection. Paradoxically, we have found that on 
the contrary, for nearly fully and fully loaded networks i.e., K ~ S and K > S, increasing the space of strategies 
of each player produces a global loss of performance. A similar paradox is observed in congestion games where 
adding extra capacity to the network ends up reducing the overall performance [17]. A similar paradox to the one 
presented in this work is also observed in [4], [27], [28], [29]. 

VI. Concluding Remarks 

We have investigated the BS selection and BS sharing scenarios in the context of self-configuring networks using 
a non-cooperative model focusing on the spectral efficiency of each transmitter. We have proved the existence of at 
least one NE in both cases. In the BS sharing game a unique NE is observed, whereas BS selection games might 
exhibit several. We have provided fully decentralized algorithms such that players can calculate their NE strategy 
based on local information and the MAI observed at each BS. We have observed that no significant gain would 
be achieved by introducing a central controller in the case of BS selection. The self-configured network performs 
almost identical to the optimally configured network. 

Finally, we have identified that depending on the number of transmitters BS selection might perform better than 
BS sharing. This result implies a Braess type paradox, where increasing the strategy space of each players produces 
a degeneration of the global network spectral efficiency. 

VII. Acknowledgments 
This work was partially supported by Alcatel-Lucent within the Alcatel-Lucent Chair in Flexible Radio at 
SUPELEC. 

References 

[1] W. Yu, W. Rhee, S. Boyd, and J. Cioffi, "Iterative water-filling for Gaussian vector multiple-access channels," IEEE Transactions on 

Information Theory, vol. 50, no. I, pp. 145-152, Jan. 2004. 
[2] T. M. Cover and J. A. Thomas, "Elements of information theory," Wiley-Interscience, 1991. 

[3] T. Alpcan, T. Basar, R. Srikant, and E. Altman, "CDMA uplink power control as a noncooperative game," 40th IEEE Conf. on Decision 
and Control, 2001. 



August 12, 2009 



DRAFT 



16 



[4] E. Altman, V. Kumar, and H. Kameda, "A Braess type paradox in power control over interference channels," Symposium on Modeling and 

Optimization in Mobile, Ad Hoc, and Wireless Networks - WiOPT, Apr. 2008. 
[5] E. Belmega, S. Lasaulce, and M. Debbah, "Decentralized handovers in cellular networks with cognitive terminals," 3rd Intl. Symp. on 

Communications, Control and Signal Processing - ISCCSP, March 2008. 
[6] S. V. Hanly, "An algorithm for combined cell-site selection and power control to maximize cellular spread spectrum capacity," IEEE J. 

Set Areas Comm., vol. 13, no. 7, pp. 1332-1340, Sep. 1995. 
[7] G. J. Foschini and Z. Milijanic, "A simple distributed autonomous power control algorithm and its convergence," IEEE Trans, on Vehicular 

Technology, vol. 42, no. 4, pp. 641-646, 1993. 
[8] C. Singh, A. Kumar, and R. Sundaresan, "Uplink power control and base station association in multichannel cellular networks," GameNets, 

May 2009. 

[9] R. D. Yates, "A framework for uplink power control in cellular radio systems," IEEE J. Set. Areas Comm., vol. 13, no. 7, pp. 1341-1347, 
Sep. 1995. 

[10] P. Mertikopoulos and A. Moustakas, "Correlated anarchy in overlapping wireless networks," IEEE Journal on Selected Areas in 

Communications, vol. 26, no. 7, pp. 1160-1169, September 2008. 
[11] D. Monderer, "Potential games," Games and Economic Behavior, vol. 14, pp. 124-143, 1996. 

[12] G. Scutari, S. Barbarossa, and D. Palomar, "Potential games: A framework for vector power control problems with coupled constraints," 

Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), May 2006. 
[13] Y. Xing and R. Chandramouli, "Stochastic learning solution for distributed discrete power control game in wireless data networks," 

IEEE/ACM Trans. Networking, vol. 16, no. 4, pp. 932-944, 2008. 
[14] M. Liu and Y. Wu, "Spectrum sharing as congestion games," Annual Allerton Conference on Communication, Control, and Computing, 

Sep. 2008. 

[15] P. Coucheney, C. Touati, and B. Gaujal, "Fair and efficient user-network association algorithm for multi-technology wireless networks," 

28th conference on Computer Communications - INFOCOM, 2009. 

[16] , "Selection of efficient pure strategies in allocation games," International Conference on Game Theory for Networks, May 2009. 

[17] D. Braess, "ber ein paradoxon aus der verkehrsplanung," Unternehmensforschung, vol. 24, no. 5, pp. 258 - 268, May 1969. 

[18] J. V. Neumann and O. Morgenstern, "Theory of games and economic behavior," Princeton University Press, 1944. 

[19] J. F. Nash, "Equilibrium points in n-person games," National Academy of Sciences of the United States of America, 1950. 

[20] S. Lasaulce, M. Debbah, and E. Altman, "Methodologies for analyzing equilibria in wireless games," IEEE Signal Processing Magazine, 

Special issue on Game Theory for Signal Processing, Sep. 2009. 
[21] B. Vocking, "Congestion games: Optimization in competition," Second ACiD Workshop. 2006, Durham, UK, September 2006. 
[22] E. Koutsoupias and C. Papadimitriou, "Worst-case equilibria," 16th Annual Symposium on Theoretical Aspects of Computer Science, 1999. 
[23] M. Thathachar and P. Sastry, "Networks of learning automata: Techniques for online stochastic optimization," Springer, 2003. 
[24] P. Sastry, V. Phansalkar, and M. Thathachar, "Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete 

information," IEEE Transactions on Systems, Man and Cybernetics, vol. 24, no. 5, pp. 769-777, May 1994. 
[25] P. Maille and B. Tuffin, "Optimization of transmission power in competitive wireless networks," Lecture notes in computer science., May 

2009. 

[26] D. P. Bertsekas, "Nonlinear programming," Athena Scientific, 1995. 

[27] J. E. Cohen and C. Jeffries, "Congestion resulting from increased capacity in single-server queueing networks," IEEE/ACM Trans, on 

Networking., vol. 5, no. 2, pp. 305-310, 1997. 
[28] Y. Korilis, A. Lazar, and A. Orda, "Architecting noncooperative networks," IEEE Journal on Selected Areas in Communications, vol. 13, 

no. 7, pp. 1241-1251, Sep 1995. 

[29] E. Altman, T. Jimenez, N. Vicuna, and R. Marquez, "Coordination games over collision channels," 6th Intl Symp. on Modeling and 
Optimization in Mobile, Ad Hoc, and Wireless Networks - WiOPT, April 2008. 



August 12, 2009 



DRAFT 



17 




(a) (6) 

Fig. 1. (a) Non-oriented graph and (b) oriented graph representing the BS Selection game with K = 3, S = 2, under the condition 
</>(p( 2 >) > </>(p( 6 >) > <Mp (1) ) > <Hp (5) ) > <MP (4) ) > <MP (7) ) > <Mp (8) ) > <KP (3) )- Total number of vertices: S K = 8, number of 
neighbors per vertex: K(S — 1) = 3. Maximum Number of NE: S* -1 = 4. Number of NE: 2 (red vertices in (b)). 



8r 




Steps 



Fig. 2. Evolution of the potential at each update of the players using the BS Selection Alg. 1. At each step one player is randomly chosen 
to update its strategy. All the sequences are obtained using the same set of channel realizations s , \/(k,s) £ K X S. Number of players 
K = 5, Number of BSs S = 3, S± = Q.14, = 0.40, and = 0.46. SNR = 101og 10 (7^) = 10 dB. 
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Fig. 3. Potential associated with each strategy profile pW as a function of i € X. The set of channel realizations h^.s, V(fe, s) 6 £ X 5 is 
identical to that one used in Fig. 2. Nash Equilibria are pointed by arrows. Number of players K = 5, Number of BSs 5 = 3, ^ = 0.14, 
^ = 0.40, and 5a = 0.46. SAT.R = 101og 10 (^g-) = 10 dB. 
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Fig. 4. Fraction of players transmitting to BS s, with s £ S, calculated using Monte-Carlo simulations and using Eq. (20) for a network with 
5 = 6 BSs and K = 100 transmitters, with a = (a s ) VsG S = (0.25,0.11,0.20,0.05,0.25,0.14). SNR = 101og 10 (^pf) = 10 dB. 
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Fig. 5. Price of anarchy (Def. 10) and price of stability (Def. 10)as a function of the number of players K for the case of two and three BSs, 
S = 2 and 5 = 3. SNR = 101og 10 (^pr) = 10 dB. 
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Fig. 7. Sum spectral efficiency for the BS sharing and BS Selection algorithms as a function of the number of players present in the network. 
Players operate at a transmit SNR 10 log 10 \^§) = 10 dB and the fractions ^ = |, Vs e S. SNR = 101og 10 (f^g-) = 10 dB. 
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